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Abstract 

We analyze an evolving network model of Krapivsky and Redner in which new 
nodes arrive sequentially, each connecting to a previously existing node b with prob- 
ability proportional to the p-th power of the in-degree of b. We restrict to the 
super- linear case p > 1. When 1 + ^ < p < 1 + r^j- the structure of the final 
countable tree is determined. There is a finite tree T with distinguished v (which 
has a limiting distribution) on which is "glued" a specific infinite tree, v has an 
infinite number of children, an infinite number of which have k — 1 children, and 
there are only a finite number of nodes (possibly only v) with k or more children. 
Our basic technique is to embed the discrete process in a continuous time process 
using exponential random variables, a technique that has previously been employed 
in the study of balls-in-bins processes with feedback. 

1 Introduction 

In some important examples of growing networks, such as the World Wide Web 
or the scientific citation network, one can interpret the fact that a given node has 
high in-degree as indicative that node is "popular". For instance, popular papers 
are the ones more often cited more by other works, and popular Web pages receive 
more links than less popular ones. A consequence of differences in popularity is 
that a node with high in-degree has more propensity to receive further edges as the 
network evolves than an unpopular node with low in-degree. In other words, the 
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more popular a node is, the more visible it is to the community that creates the 
network and/or interacts through it, and high visibility makes future increases in 
popularity more likely. 

Barabasi and Albert [I] incorporated this so-called preferential attachment phe- 
nomenon into a generative model for these and other networks. In this model, 
nodes arrive at the network one at a time, and direct a fixed number m of edges 
to previously existing nodes that are chosen with probabilities proportional to their 
in-degrees. It is quite remarkable that this simple model already replicates many 
non-trivial features of the above networks, such as power-law degree distributions, 
small diameter and high resistance to random failures, as argued non-rigorously by 
physicists (see |2J ^ and references therein) and later proven rigorously by mathe- 
maticians [iniEiiniEi- 

The success of the Barabasi- Albert model has also inspired many different vari- 
ants. The models in |12| Ej permit that the power-law exponent of the degree 
distribution be adjusted to fit real-world data. Other models [3] feature preferential 
attachment that is dictated both by node fitness and popularity. This work is ded- 
icated to yet another kind of variant of the model of [3] , one in which the strength 
of preferential attachment can be varied. 

This model was proposed and studied by Krapivsky and Redner JH] and in- 
dependently by Drinea, Mitzenmacher and Enachescu |15| . It differs from the 
Barabasi-Albert network in that each incoming node chooses a pre-existing ver- 
tex to link to with probability proportional to a fixed function / (the attachment 
kernel) of the degree of that vertex 1 . While the Barabasi-Albert model is recovered 
by setting f(x) = x, we will be mostly concerned with kernels of the form f(x) ~ x p 
with p > 1 thought of as a tunable parameter; this is referred to in ^H] as the super- 
linear case. One of the many remarkable non-rigorous results about this so-called 
GN (Growing Network) model is that it undergoes an infinite sequence of connec- 
tivity transitions at p = pk = 1 + 1/fc, k = 1, 2, 3, ... . By this it is meant that for 
p > Pk the GN process has only finitely many vertices that receive more than k links, 
whereas for p < pk the number of such vertices is infinite. Another way of stating 
this property is the following: the smallest integer k for which p > p^ = 1 + 1/k is 
also the smallest number k for which only finitely many nodes ever reach in-degree k. 

The connectivity transitions are both mathematically intriguing and physically 
interesting. The fact that the p > 1, p = 1 and (conjecturally) p < 1 cases of 

^As in the original Barabasi-Albert model, One could also consider a similar model in which each 
incoming node creates a fixed number m of new edges, but we will only consider the case m = 1 in this 
paper 
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the model are very different leads the authors of 14 to suggest that so-called self- 
organized criticality is at work in networks with power-law degree distributions (the 
p = 1 case). It was also noted elsewhere that the condensation regime of the 
fitness model of Bianconi and Barabasi [3] has qualitatively similar behavior to the 
super- linear GN; [Q even suggests that a direct connection between the two models 
could exist. There is also some modelling interest in the connectivity transitions, 
since networks in which preferential attachment is very strong (conceivably even 
some parts of the World-Wide Web) should exhibit behavior that is qualitatively 
similar to the GN model in the super- linear regime. 

Despite the striking characteristics, we do not know of any rigorous work on the 
GN model to the present date. A modified model was addressed in independent 
work by Chung, Handjani and Jungreis In their process, an attachment kernel 
is still present, but at each time step either a new vertex and a new edge are added 
with probability < q < 1, or only a new edge is added with probability 1 — q. This 
modified model does exhibit connectivity transitions in the sense of jlSj . but it is 
not clear how to deduce the analogous results for the original GN model from the 
techniques in [TT] . 

In this paper we attempt to give a rigorous description of the super- linear GN 
process in the large-time limit. Our rigorous results imply the existence of connec- 
tivity transitions, but they also go beyond that. The first result we prove is the 
following. 

Theorem 1.1. Let {T m } m >i be the GN process with attachment kernel f(x) = 
(x + l) p (defined in Section^). Also let be the increasing limit of the {T m } m >i 
process, and assume that p > pk = 1 + 1/k. Then with probability 1 all but finitely 
many nodes of have less than k descendants (cf. the definition in Subsec- 

tion \KW - 

A vertex of T^ with in-degree larger than or equal to k necessarily has at least 
k descendants. For this reason, Theorem 11.11 implies that for p > pk, only finitely 
many vertices of T^ have in-degree > k. As a result, the number of vertices in T m 
with in-degree bigger than k is bounded as m — > +oo. This shows that Theorem ll.il 
implies the non-rigorous "p > p^ result of 18 ], and is in fact stronger than it. 
Similarly, Theorem 11.21 below implies the p < pk case of Krapivsky and Redner's 
result. 

Theorem 1.2. Let f, p > 1 and be as in Theorem and let k = k p be 

the smallest positive integer for which p > pk = 1 + 1/k. Consider the construction 
Glue(T, v, k) defined in Subsection \2. 6 A Then the set of values (up to isomorphism) 
that Too attains with positive probability is precisely the set of all trees that can be 
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obtained by choosing a finite rooted tree T, a distinguished vertex v G T and setting 
= Glue(7>,A;). 

Theorem 11.21 completely describes (up to isomorphisms) the limit set of the GN 
process in the large-time limit. In particular, it also implies that if p < p^, the num- 
ber of vertices of in-degree > k in T m diverges as m — > +00. This differs from the 
original claim in |18j . in which the authors argue that the expectation of the number 
of vertices of degree I > k diverges at a certain rate. While we have nothing to say 
about this rate, Theorem ll.2l is stronger than the claim of ^| in that divergence of 
the expected number is implied, but does not imply, almost sure divergence. More- 
over, our description of the structure of Too is new. Finally, we note that there is 
nothing special about the choice of f(x) = (x + l) p as our superlinear kernel. In 
fact, the proof of both theorems will make it clear that it suffices to assume that 
f(x) > for all x and that f(x) = (x p ) for x S> 1, with only minor modifications 
in our arguments. 

We now briefly outline our proof techniques. On a high level, we rely strongly 
on the similarity pointed out by Drinea, Frieze and Mitzenmacher ^S] between the 
GN process and balls-in-bins models with feedback. The latter model describes the 
evolution of a system with a fixed number of bins at which balls are thrown. A 
ball arrives at each discrete time step and chooses a bin to go into with probability 
proportional to a fixed function / (that we call the feedback function) of the number 
of balls currently in that bin. This model can also be viewed as a static variant of 
the GN process in which new edges are repeatedly added but without the creation 
of any new nodes/bins . This analogy permits that a certain technique applied to 
the study of balls-in-bins problems EE HH is adapted to the GN process. It 
consists of building a continuous-time process out of exponential random variables 
and showing that it embeds the original discrete-time process. For this reason we 
call this construction the exponential embedding. The GN version of the exponential 
embedding is essential to the construction and analysis of the infinite tree limit Too , 
and we view it as an important part of our paper's contribution. 

The remainder of the paper is organized as follows. In Section |2] we introduce 
our notation and review a few basic concepts. We formally define the GN process 
in Section |3J starting with its original definition in |18L I15| , and then describing a 
useful labelled version of it. Section introduces the exponential embedding tech- 
nique. We begin with a review of the simpler balls-in-bins case, then move on to the 
construction of the embedding of the GN process for general attachment kernels. We 
then employ the embedding to show that so-called "explosive kernels" give rise to 
GN processes for which T m — > Too m finite time under the exponential embedding. 
This section ends with some lemmas on sums of exponential random variables that 
will be useful later on. Theorem 11.11 is proven in the subsequent Section [5J The 
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section starts with weaker results that intuitively pave the way for the actual proof 
of the Theorem, which relies on a careful consideration of the time of the birth of the 
kth descendant of a given node in the exponential embedding setting. In Section 
we prove Theorem ll.2| relying on Theorem 11.11 and on the techniques developed 
in the previous sections. We discuss some consequences of our main theorems and 
some related open questions in the Conclusion (Section EJ). The Appendix contains 
the proofs of some technical results. 

Acknowledgements. We thank Eleni Drinea and Michael Mitzenmacher for 
bringing this problem to our attention and for useful discussions. We also thank 
the anonymous referees for pointing out several typos and making suggestions that 
greatly improved our presentation. 

2 Preliminaries 

2.1 Probabilistic ingredients 

We briefly remind the reader of some basic probabilistic concepts and tools, while 
also fixing some notation. 

Distributions. We say that two random variables X, Y taking values on the 
same set U have the same distribution (or are identical in law) if for all measurable 
subsets A C U P (X € A) = P (Y G A). This will be symbolically represented by 
X = d Y. 

The exponential distribution. A random variable X is said to be exponentially- 
distributed with rate A > if X almost surely takes values on the positive reals and 

P (X > t) = e~ xt (t > 0) 

We denote this property by X = d exp(A). The shorthand exp(A) will also denote a 
generic exponentially-distributed random variable with rate A. We list below some 
elementary but extremely useful properties of those random variables. 

1. Lack of memory. Let X = d exp(A) and Z > be independent from X. The 
distribution of X — Z conditioned on X > Z is still equal to exp(A). 

2. Minimum property. Let {Xi = d exp(Aj)}^ 1 be independent. Then X m j n = 
mini<j< m Xi = d exp(Ai + A2 + • • • A m ). Moreover, for all 1 < i < in, 

P (Xi = X min ) = - — — — — . 

M + M + ■ ■ ■ *m 
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3. Multiplication property. If X = d exp(A) and rj > is a fixed number, r\X = d 
exp(A/r/). 

The Borel-Cantelli Lemma. Let {A n } n£ ^ be a sequence of events in some 
fixed probability space, with N a countable set. The event u A n infinitely often 
(n G N)" (or "A n i.o. (n G iV)") contains all outcomes that belong to an infinite 
number of the events A n . The Borel-Cantelli Lemma states that 

P {A n ) < +00 P (A n i.o. (n G JV)) = 

nGTV 

and 

V] P (A n ) = +cx) and {yl n } n6A r independent => F (A„ i.o. (n G iV)) = 1 . 

Discrete-time Markov Chains. A (discrete-time) Markov chain on the count- 
able set £1 is specified by transition probabilities U : X O — > [0, 1] and a initial 
condition Xq G (possibly non-deterministic). The recipe 

t 

F(V0 < i < t X 4 = Wi ) =F(X = cu )]Jn(^_ 1 ,^) 

i=l 

defines the distribution of a sequence {AQ}^ 1 of fi-valued random variables. 

2.2 Tree terminology 

Trees. All trees are rooted and have their edges directed towards the root. No 
loops or parallel edges are allowed. Given vertices a, b in a tree T, the existence of 
the oriented edge (a, b) will be indicated by saying that a is a child of b, or that b 
is a's parent, or that a links to b. With this terminology, the (in-)degree dy(6) of b 
in T is the number of its children. If r is a node of T, the subtree T r of T rooted at 
r is the tree with root r, together with r's children, the children of those children, 
and so on. The nodes in T r \{r} are referred to as the descendants of r, and r is 
said to be k-fertile in T if it has k or more descendants. Theorem 11.11 consists of 
showing that for p > Pk, only finitely many nodes in T^ are A;- fertile. 

The 'Glue' construction. Given a finite (rooted, oriented) tree T, a distin- 
guished node v of T and an integer k > 1, we define Glue(T, v, k) as follows. For 
each finite (rooted, oriented) tree S on k or less nodes, take countably many copies 
{Si}i>i of S. Glue(T, v, k) is the union of T with all the trees Si as above, with 
the addition of edges from the root of each one of the SVs to v. 
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Figure 1: An example of Glue(T, v, k) for k = 3. 

As a simple example suppose T consists of a single node (the root v ) and k = 2. 
Then in Glue(T, v , k) the root has a countably infinite number of children. In- 
finitely many of these children are childless and infinitely many of these children 
have precisely one child and none of them have more than one child. Further, all 
grandchildren of the root are childless. 

A more complex example of Glue(T, v, k), now with k = 3, is portrayed in Fig- 
ure n The starred node is v, and the finite tree T lies to the left of the dashed line. 
The countably many copies of the four rooted trees on 3 or less vertices (numbered 
1, 2, 3 and 4 in the Figure) appear to the right of the line, and are all connected to 
v by their roots. 

Our task in proving Theorem 11.21 will be to show that with probability 1 there 
exist T and v as above with T^ = GLUE(T,u,fc), and that all such Glue(T,v,/c) 
occur as values of T^ with some positive probability. 

2.3 Labelled trees and parent-closed sets 

Labels. It will be convenient for us to label the vertices of trees. For our pur- 
poses, a label is a (possibly empty) sequence of elements of the set N = {1, 2, 3, . . . } 
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of positive integers. The empty sequence is denoted by e, and all other sequences 
a = {cii}^ C N (with m > 1) will be represented by a = a\a2 ■ ■ ■ a m . Moreover, 
we call ai . . . a m _i (the sequence a without its last element) the parent sequence of 
a. The set of all labels will be denoted by N*. 

Labelling trees. A labelling of a finite tree T is an assignment of labels to 
the vertices of T that obeys two rules. 

• the label of the root of T is the empty sequence e; 

• if vertex v has degree d and is labelled by the sequence v\ . . . v m , its children 
will receive labels v\ . . . v m i, 1 < i < d. 

The second rule implies that the label of a vertex -u's parent in T is the parent 
sequence of the label of v. 

Parent-closed subsets. A subset A c N* is said to be parent-closed if it 
is non-empty and for all non-empty sequences a G A the parent sequence of a is 
also in A. Any parent-closed A corresponds to a finite tree with vertex set A and 
edges from each a G ^4\{e} to a's parent. Conversely, given a tree T, the labelling 
procedure above provides a proper set A = A(T) that corresponds to tree T. This 
set A(T) is not uniquely defined, but this will not keep us from representing finite 
trees by finite parent-closed A C N* in what follows. For this reason, we will 
often apply tree terminology to parent-closed A C N*, speaking for instance of the 
degree dA(a) of an element a £ A. We also observe that the potential descendants 
of a = ai . . . a m G N* are obtained by adjoining the terms of another sequence 
b = b± ... b n to a, thus forming the concatenation ab = a\ . . . a m b\ . . . b n . Finally, we 
define for convenience 

£fin = c N * . A p aren t- C lo S ed and finite}. 

3 Definition of the GN process 
3.1 The standard definition 

The GN[/] process is defined in terms of an attachment kernel, that is, a function 
/ : N U {0} — > IR + . The process evolves in discrete time m = 0, 1, 2, 3, . . . ; its state 
at time m > is a (rooted, oriented) tree T m = (V m ,E m ) with vertex set V m and 
edge set E m . Initially, To contains a single root node and no edges. At each time 
m > the tree is updated by the addition of a new node (V m = V m -\ U {v m }) and 
a new edge (E m = E m ^\ U {v m w m }), where w m is chosen according to the following 
probability distribution: 
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W r- T/ TTD / I rp \ /(d Tm _!(w)) 

G K m _i P (u> m = io | T m _ij - 



These definitions already specify the process completely as a finite-tree- valued Markov 
Chain. 

3.2 The labelled process 

In the labelled GN[/] process, we start by labelling the root (and unique element) of 
To by the empty sequence e. At subsequent times m > 1, assume that the incoming 
node v m links to a node w m that is labelled by the sequence a\ . . . a n , and that 
v m is the £th node to link to w m . Then the label of v m is defined to be a\ . . . a n l, 
i.e. the sequence corresponding to v m 7 s parent w m , with a new number £ added to it. 

This recursive labelling obeys the definition of a labelling of a tree given in 
Subsection 12.31 and provides an alternative description of the process as a Markov 
Chain on £ fin , as defined in Subsection 12.31 The transition probabilities of the GN 
process on £ fin are: 

U(A,B) = /(d £ Q) l, „ , if 3a € A : B = A U {a(d A (a) + 1)} (3.1) 
= otherwise 

and its initial state is To = {e}. We note in passing that the limit T^ = IJ m > T m 
of the GN process takes values in the uncountable set 

£ = {A C N* : A parent-closed and non-empty}. 

£ a closed subset of the topological space 2 N * (with the product topology). We will 
refrain from explicitly considering measurability questions related to Too and £ in 
what follows, since all such problems can be addressed in a rather straightforward 
manner. 



4 Exponential embedding 

Our aim in the present Section is to present the special construction of the labelled 
GN process that we alluded to in the Introduction. We will show how one can explic- 
itly embed the process in continuous time by employing sequences of independent 
exponential random variables. Although perhaps complicated at first sight, this 
embedding will prove to be fundamental to our analysis, with the independence of 
the involved random variables playing a key role in most of our computations. 
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4.1 The balls-in-bins case 



Davis ^3] applied the elementary properties of exponential random variables to the 
study of Reinforced Random Walks in a very interesting way. His method was later 
adapted by Khanin and Khanin |17) to the balls-in-bins setting. We present this 
latter use of exponential random variables below (which was also rediscovered by 
Spencer and Wormald [20]) as a preparation for the more difficult GN case. 



Consider independent random variables {Xj,Yj = d exp(/(j))} JgNU { } and de- 
fine, for t > 



n-1 



N(t) = sup{n <E N U {0} | ^ Xj < t}, 



i=0 
m—1 



M(t) = sup{m e N U {0} | E y i - *}• 

3=0 



We interpret the times Y17=o -^i and Sj=o as ^ ne times when iV(-) and M(-) 
receive their n-th and m-th "hits", respectively. We now fix some t > and n,m £ 
N U {0} and define the event 

{n—l n m—1 m 1 

^Xi<t< ^Xi, Y,Yj<t<Y, Y A- 
i=0 i=0 j=0 3=0 J 

What is the probability that the N(-) process is the first one to receive a hit after 
time t, conditioned on A l n ? This probability can be written as 



n 



i=0 j=0 



A 1 



If we further condition on Y17=o = s% < t and Yl^o 1 Yj = s % — we can wr it e 
this probability as 

P (X n + Sl < Y m + s 2 | X n > t - si, Y m > t - s 2 ) 

The lack-of-memory property of exponentials implies that under the conditioning 
event above X n — t + s\ = d exp(/(n)) and Y m — t + S2 = d exp(/(m)). The minimum 
property then implies 

P (X n + si < Y m + s 2 | X n > t - si, Y m > t - s 2 ) = 

f(n) 



P (exp(/(n)) + t < exp(/(m)) + t) 



f(n) + f(m) 
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Since this holds for all < s±, S2 < t, we have in fact proven that 

/(") 



i=0 j=0 



A 1 



f(n) + f(m) 



We thus arrive at a surprising conclusion. 



Fact 1 (Exponential embedding for balls-in-bins, [13j I20p. Consider the 
balls-in-bins process lib}/ with two bins and feedback function f, i.e. the discrete 
Markov Chain that evolves from state (n,m) £ (N U {0}) 2 to state (n + l,m) with 
probability ^( n |ffi( m ) and from (n,m) to (n,m + 1) with probability f(Jy+j( m ) ■ R 
then holds that the joint hit counts of the (iV(-), M(-)) processes up to the (possibly 
finite) time when either one becomes infinite is identical in law to the balls-in-bins 
process with feedback function f started from (0, 0). That is, the balls-in-bins process 
is embedded in the continuous time (iV(-), M(-)) process, with X~ (respectively Ya) 
parameterizing the time between the arrivals of the j-th and (j + l)-th balls at the 
first (resp. second) bin. 

Many non-trivial results that do not have direct combinatorial proofs can be 
deduced from the above construction. This method seems to be especially powerful 
in the case when either N or M reaches an infinite value in finite time. The reader 
is directed to |171 I19[ I20| for many examples of applications of the exponential 
embedding. We will now show how we can adapt this technique to our present 
context. 



4.2 Exponential embedding of the GN process 

As pointed out in the introduction, a balls-in-bins process with feedback function / 
is very similar to a GN process with attachment kernel / to which only new edges 
(and no new vertices) are added. Conversely, one may think of a GN process as a 
balls-in-bins process in which each new ball also creates a corresponding bin. This 
analogy was exploited in in which a variant of the original GN process was 
modelled as an "infinite Polya Urn process" for the purposes of studying the degree 
sequence. We take this analogy further by adapting the exponential embedding 
technique to the labelled GN process as defined in Subsection 13.21 

Our construction starts from an independent sequence {X(a,j) = d exp(_f(j)) | 
a G N*,j G N U {0}} of random variables. The random variable X(a, 0) shall 
correspond to the age of vertex a at the time its first child al is born. For j > 1, 
X(a,j) shall parameterize the time between the births of the j-th and (j + l)-th 
children of the a. Therefore, the sequence {X(a, j)}a,j plays a r °l e that is similar to 
that of the X^s and Yj 's in Subsection 14. II above. There is, however, one important 
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difference: whereas balls-in-bins processes always have a fixed number of bins at 
which balls/hits arrive, the number of "bins" in the GN process grows. That is, the 
potential vertices a G N* of the trees {T m } m >o do not all come into existence at 
the same time; they are rather born at appropriate times. We therefore introduce a 
notion of birth time, which is defined recursively as follows. 

• the birth time of the empty string a = e is B(e) = 0; 

• let a\, a,2, ■ ■ • , a n G N and consider the sequence a = ax ■ ■ ■ a n . The birth time 
of a is the birth time of the parent sequence b = a± . . . a n -x plus the time until 
the a n -th birth at b. More precisely, 

On — i 

B{a) = B(ax ...a n ) = B(ax . . . a n -i) + X{a\ . ..a n -x,j). 

j=0 

An equivalent form of the definition of B(a) is 

n— 1 a i+i— l 

B{a) = B(ax...a n ) = Y J £ X( ax-..ai,j). (4.1) 

i=o j=o 

Our continuous time process is defined by setting 

W(t) = {a E N* : B(a) < t} (t G R). 

W(-) always takes values in the set £ of parent-closed subsets of N* (defined in 
Subsection I2.3|) . This is because the definition of birth time implies that the birth 
time of ax ■ ■ ■ a n -x is always smaller than or equal to that of ax ■ ■ ■ a n . 

Let us now specialize to the case where / : NU{0} — > R is given by f(x) = (x+l) p 
for some constant p > 1. Such attachment kernels satisfy the explosion condition 

n>0 J K ' 

The condition implies that the expectation of 

+oo 

V(a) = J"X {a, j)= sup B(ak)-B(a) (a G N*) (4.3) 

is finite. Therefore, all the random variables defined in (|4.3|) are almost surely finite. 

Definition 4.1. For an element a G N*, the random variable V(a) defined in 

is the explosion time of a. The infimum of B(a) + V{a) over all a G N* is the tree 

explosion time, or the explosion time of the VV(-) process, and is denoted by S. 

S = inf B(a) +V(a) (4.4) 

aeN 
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The intuition behind the definition of S is that it is the first time when some 
node in the VV(-) process has an infinite number of children. In fact, we claim that 

Claim 4.2. The following events hold with probability 1. The birth times 13(a) that 
are smaller than S are pairwise distinct and can be well ordered with order type u. 
Letting = B(e) = Bq < B\ < . . . < B n < . . .denote their ordered sequence, B n /* S 
as n — > +oo. Moreover, there exists a unique v £ N* that has infinite degree in 
W(S); this v satisfies B(v) + V(v) = S and B(w) + V(w) > S for all w ^ v. 

A direct consequence of Claim FOl is Theorem 14.31 below. 

Theorem 4.3. Let {£> n } n >o be as in Claim \4^ Then the sequence {T n = W(B n )} n >o 
is identical in law to the labelled GN process. Moreover, T n — > = W(<S) as 



However, to prove Claim li~2l we will need some elements of the proof of The- 
orem ^3 This could potentially result in a problem: using Theorem 11.11 to prove 
Claim POl then employing the Claim to prove Theorem 14.31 and finally using this 
Theorem in the proof of Theorem 11.11 would not be acceptable. Instead, we circum- 
vent this difficulty as follows. 

1. In the beginning of the next Section, we state Lemma l5.1( which is the same 
as Theorem 11.11 but with W(iS) replacing T^ in the statement. 

2. Claim l4~2*l is then proven, assuming the Lemma. 

3. The remainder of the Section proves Lemma l5.1[ without assuming Claim l4~2l 
or Theorem 14.31 in any way. 

4. The argument below shows that Claim l4~2l implies Theorem l4.31 which directly 
implies that Lemma 15. II can be strengthened to Theorem ll.il 

Irrespective of formal proofs, the reader should keep in mind that W(«S) rep- 
resents the tree T m in the statements of Theorem 11.11 and Theorem 11.21 Vertices 
a € N* whose birth times satisfy B(a) > S are not really "born" in T^, but rather 
constitute a fictitious continuation of T^ in which new vertices continue to arrive 
even though infinitely many vertices have already appeared. We will use this con- 
tinuation to our advantage in many of the proofs below. 

Proof: [of Theorem l4.3j Assuming Claim l4~2*l it suffices to show that for all A € £ fin 
and all a E A 

P (first birth of VV(-) after time £ is at a \ W(t) = A) 



n 



+oo. 




mmB(bd A (b)) + X(b,d A (b)) | W(t) = A 



= U(A,Au{a(d A (a) + l)}). (4.5) 
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To prove this, we first observe that the conditioning event is 

{d A (c)-l d A (c) \ 

VceAB(c)+ X(c,j)<t<B(c)+Y, x ^j)\- 
3=0 j=0 J 

We proceed as in the previous section and condition on the values X(b, j) = x(b, j) > 
for b G A and < j < d A (b) — 1. We want this event to be a subset of = ^4}, 

so we require that the birth times of all b G A are at most t; that is, we must have: 

r-l 

V6i . . . b r G A y(6i ...b r ) = Y^ Yl x( - bl ■ ■ ■ bl >^ < t. (4.6) 

i=0 j=0 



Under this more stringent conditioning, the probability we wish to compute is 

F^y(adA(a))+X(a,d A (a)) = mmy(bd A (b))+X(b,d A (b)) b) , (4.7) 

where 



B = { V6 G A X(b, d A (b)) > t - y(b, d A (b))}. 

The exponential random variables in (|4.7j) are all independent. Moreover, by the 
lack of memory property, X(b, d A (b)) +y(b, d A (b)) — t conditioned on X(b, d A (b)) > 
t — y(b,d A (b)) is distributed as exp(f (d A (b))) . It follows that 

P (y(ad A (a)) + X(a, d A (a)) = mmy(bd A (b)) + X(b, d A (b)) \ B 

\ b&A 

exp(/((U(a))) - t = minexp(/(d A (6))) - t ) (4.8) 
where all exp's are independent. From the minimum property, this last probability 

M.M) (49) 



Y.KAtt&A{b)) 

and this holds irrespective of the values {x(b, j)}, as long as (|4.6I) is satisfied. As a 
result, (ji~oT) holds. □ 

Remark 4.4. The proof of Theorem \4-3\ makes it clear that the W(-) process is 
a continuous-time Markov Chain on £^ n up to time S. A consequence of this is 
the following. Let t > be given and let E be an event for W(-) that is entirely 
defined in terms of {W(s)}o< s <t • Assume that inside the event E, W(t) = A G £^ n 
. Finally, let F be an event defined entirely in terms of {W(s)} s >j. Then 

¥(F | E) =F(F | W(t) = A) 

We will employ this Remark in the proof of Lemma \6.1\ and Lemma \6.2\ below. 
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4.3 Three useful lemmas 



Before we move on to prove the main theorems in the paper, we collect three lem- 
mas (proven in the Appendix) that will be useful in dealing with sums of inde- 
pendent exponential random variables. The present lemmas provide estimates of 
several probabilities that are intimately related with the presence of nodes with k 
descendants in the final tree Too. All of them are key ingredients of the proofs of 
Theorem 1 1 . 1 1 and Theorem 11.21 We assume that f(x) = (x + l) p with p > 1 in all 
statements. 

Lemma 4.5 (A large-deviations bound). There exist constants C, no > de- 
pending only on f such that for all n > uq, all independent sequences of random 
variables {Xj = d exp(f(j))}j> n , and all 5 > 



Lemma 4.6. Let Y = Y\ + • • • + Y^ be a sum of k independent random variables 
for which 



Then there exist constants C, no depending only on k, f and the distributions Yi 
such that for all independent sequences of random variables {Xj = d exp(/(j ))},->„ 
that are independent of Y and all n > uq 



Lemma 4.7. Let Zi,...,Z^ be independent exponentials with mean one and let 
Z = Z\ + . . . + Zk. Then for all X > 



5 Finitely many k-fertile vertices 

In this section we prove the first of our main results about the GN process, Theo- 
rem As noted in the previous section, Claim li"2"l - which has not been proven 




VI < i < k P(Yi<e) = 6(e) as e \ 
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yet - is necessary for the connection between the exponential process and the GN 
process. Proving the Claim will require the a preliminary form of Theorem II . II that 
we shall present below. We assume throughout the section that f(x) = (x + l) p for 
some p > pk = 1 + l/k. 

Recall that a node is k-fertile if it has k or more descendants in the corresponding 
tree. 

Lemma 5.1. Consider the W(-) process defined in Section^ and assume its at- 
tachment kernel is f(x) = (x + l) p , p > Pk- Then, for all T > 0, 

E [#{a € W(«S) : a is k-fertile} \ V(e) <T]< +oo, (5.1) 

and therefore 

P (#{a G W(S) : a is k-fertile} < +oo | V(e) < T) = 1. 

Since V(e) < +oo almost surely, this implies that W(S) almost surely has only 
finitely many k-fertile vertices. 

As noted in Section |11 we will use Lemma 15.11 to prove Claim 14.21 and this in 
turn will imply that Theorem 14 . 31 holds . This last Theorem and Lemma 15 . 1 1 directly 
imply Theorem ll.il Therefore, most of the present section will be devoted to prov- 
ing Lemma 15.11 

This section is organized as follows. In Subsection 15. II we show how Lemma 15. II 
implies Claim l4~2l Having settled that matter, we move on to proving Lemma 15. II 
Our proof will consist of bounding the probabilities of the form 

P (a is /c-fertile | V{e) < T) , 

and then showing that their sum is finite. We illustrate our techniques for do- 
ing so in Subsection 15.21 below, where we show a partial result in the direction of 
Lemma 15. II We then show in Subsection 15.31 that the time at which a given a € N* 
becomes A:-fertile in the W(-) process can be bounded in terms of a sum of k ex- 
ponential random variables (Lemma I5.6|) . This permits an improved bound on the 
probability of fe-fertility (Subsection l5.4|) . which is then applied to prove Lemma l5~T1 
in Subsection 15.51 

5.1 Lemma 15.11 implies Claim 14.21 

Proof: [of Claim |4~2~] The following lemma is a well-known combinatorial result. 
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Lemma 5.2. [Konig's Infinity Lemma] Let T be an infinite rooted tree in which 
every vertex has finite degree. Then T contains an infinite path starting from the 
root. 

We will use the Infinity Lemma and Lemma l5.1l to prove a series of almost-sure 
statements that imply the Claim. 

All birth times are almost surely distinct. This occurs because, for all distinct 
a, b G N*, the difference B(a) — B(b) is a sum of terms of the form ±X(c,j) for 
some (c, j) G N* x NU {0}. Each such term has a smooth distribution with no point 
masses, and all terms are independent, hence B(a) — B(b) ^ with probability 1. 

There almost surely exists at least one vertex v G N* with infinite degree in 
W(5).For suppose that this were note the case. Since W(<S) is infinite, the Infinity 
Lemma would imply that there was an infinite path starting from the root in W(«S). 
But all the infinitely many vertices on such path would have > k descendants, for 
any k G N. However, p > 1 implies that p>pk = l + l/k for some k G N, and 
Lemma 15. II then implies that only finitely many vertices in W(5) can be /c-fertile, 
a contradiction. 

There almost surely exists a unique vertex v for which S = B(v) + V(v). With 
probability 1, there is a vertex v of W(«S) with infinite degree. Since the degree of 
v is infinite in W(5), all the children of v must have been born before time S. 

Vra G N, B{vn) < S. 

As n — > +oo, B(vn) — > B(v) + V(v), thus B(v) + V{v) < S. Then, by definition 
of S, S = B(v) + V(v). Thus there exists a v as claimed. For uniqueness, one can 
show that B{a) + V(a) / B(b) + V(b) for all distinct a, b G N*. 

Wji/i probability 1, W(i) is /im£e /or a// 1 < s. Suppose that is not the case. For 
all a G N* 

B(a) + P(a) = lim B(ad) > S > t, 

which implies that for all a there is an integer d a > such that B{ad a ) > t. There- 
fore, any a has finite degree < d a — 1 in W(t). By the Infinity Lemma, W(t) must 
then have an infinite path from e, ai, a±a2, aia2a%, .... But all nodes along this path 
have infinitely many descendants in W(i), and hence also in W(«S), which was shown 
above to have probability 0. The contradiction implies the assertion. 

The set of birth times before S can be well-ordered. This is a consequence of the 
previous assertion. 



17 



With probability 1, when the descendants of v are removed from W(S), the result 
is a finite tree. Again, the key property here is that all a 6 N*\{u} have finite de- 
gree. So if W(<S) without the descendants of v would be infinite, the Infinity Lemma 
would imply the existence of an infinite path in W(«S), which would imply that all 
nodes along the path have infinitely many descendants. Since this is impossible, the 
assertion must be true. 

The ordered birth times B$ = < B\ < £>2 < . . . are almost surely distinct and 
converge almost surely to S. That they are distinct follows from the first assertion. 
Since they form an increasing sequence bounded by S < +00, they converge to some 
finite limit. But the birth times {B(vm)}^^ 1 (with v as in the previous paragraph) 
form a subsequence of {Bn}^^^ that converges to B{v) + V{v) = S, so the {B n } n 
sequence converges to S as well. 

The series of assertions implies the Claim. □ 

5.2 Two instructive examples 

Having shown that Lemma 15.11 implies Claim W?2\ we now turn to the proof of the 
Lemma. Recall that the goal of that lemma is to prove that only finitely many ver- 
tices have k or more descendants in W(<S). For the sake of the reader, however, we 
first consider two special classes of a € N* and prove that only finitely many nodes 
in each class have large degree. While the corresponding general result combines 
ingredients of the two special cases below, we believe that our techniques become 
much clearer if introduced separately. 

To state the present results, we need two definitions. Fix a number L > 0, 
and call o £ N*\{e} L-moderate if all numbers in the sequence a are smaller than 
or equal to L. If on the other hand all numbers in a are bigger than L, call it 
L-extreme. Our two simple lemmas are presented below. 

Lemma 5.3. For all integers L > and all T > 0, the expected number of Im- 
moderate 1- fertile vertices in W(«S) conditioned on V(e) = T is finite. 

Lemma 5.4. There exists a constant Lq > defined only in terms of p such that 
for all integers L > Lq, the expected number of L-extreme vertices in VV(cS) that 
have at least k children is finite. 

Proof: [of Lemma l5.3j For any a = a\ . . . a m , the time for the birth of the first child 
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of a is 



B(al) = B(a) + X(a,0) 

m—l a j+i — 1 m 

= Y^ Yl X(a 1 ...a j ,i)+X(a 1 ...a m ,0) > JTX(ai ...Oj-,0). (5.2) 

j=0 i=0 j=l 

Notice that this lower bound on B{a\) is actually independent of V(e), which is at 
least as big as the tree explosion time <S. As a result: 

P (a is 1-fertile | V(e) = T) = P (B(al) < S \ V(e) = T) 

(m \ I m \ 

J2 X ( a ± •••a j ,0) < Vie) | V{e) = T J = P X( ai . . . a jt 0) < T . 



We now apply Lemma H77I with A = T and Z = X]j=i x i a i . . . a,j, 0) to deduce 
P(a is 1-fertile | P(e) = T) < P ( ^Xfa . . . Oj, 1) < T J < 



X[a x ...a u \) <T I < 3 

\i=l / 

There are L m L-moderate a of length m, and this implies that 

+°° rpmjm 

V P (a is 1-fertile I Vie) = T) < V — = e TL - 1< +oo. 

^— ' m! 
a L-moderate m_1 

This finishes the proof. □ 

Proof: [of Lemma l5.4| We assume L > k + 1, no, where no comes from Lemma 14.61 
Fix an L-extreme a = a\ . . . a m with all Oj € N, and let a m+ i = k. The event 

{a has at least k children} = {£>(afc) < 5} (5-3) 

is contained the event 

{fe-l +oo 1 

J2X( ai ... ai ,j) < ^X(ai...ai_i,s) I , (5.4) 
j=0 s=a, I 
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for each 1 < i < m. This is true because for all 1 < i < 



rn 



k-i 



B(ai . . . a,) + y~] X(ai ...CLj, j) = B(ai...a,ik) 

3=0 

(by properties of birth times) 

< B(ai . . . aidi+i) 

(since en > L > fc for i < to, and a m+ \ = k) 

< B{ak) 

(since ak is either a descendant of 
a\ . . . a,i + \ or equal to ak) 

and 

S < B(a 1 ...a i -i)+V(ai...a i -i) 
(by definition of S) 

+oo 

= B(ai . . .Qj-i) + y^_ j X(a 1 . . . aj-ij) 

3=0 

+oo 

= B(a 1 ...a i ) + ^2x(a 1 ...a i - 1 ,j). 

j=a-i 

So that 

{a has at least k children} 
= {B(ak) < S} 

(k-l +oo 
B(ax ...a,i) + ^X(ai . . . a i: j) < B{a x . . . a») + ^ X(a 1 . . . a,_i, j) 



j=o 



3=ai 



fc— 1 +oo 

^X(ai ...ai,j) < ^2x(a 1 ...a i - 1 ,j) 



j=o 



3=»-i 



a,i- 



Now note that all the events {-ff a ,i}i<i<m are i n f ac t independent. In fact, for any 
1 < i < n — 1, j depends only on the random variables X(ai . . . aj_i, j) with 
j>ai>L>k + l and X(ai . . . aj,£) with < ^ < fc. Therefore, the choice of L 
implies that no random variable can appear in the definitions of two different H a i . 
Therefore, 



' (a has at least fc children) < P {H a ^) . 



i=i 
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Now notice that 

F(H a>i ) =P Ij^Ye < ^X(a 1 ...a i _ 1 ,j) J , (5.5) 

V=l j>di J 

with Yg = X{a\ . . . ai, I — 1). It is straightforward to check that the assumptions of 
Lemma 14.61 hold (since we know a\ > L > no) and that as a result 

P (^)<^i^ ; M) 
of 

where C depends only on p, as the distributions of the Y^'s are determined by p. It 
follows that 

E P (a has k children) < ^ £ HE(Fi) = E ( Z^IFI ) ' 

a L-ext m=lai,...,a m >i!=l a i m=l 

(5.7) 

i 

Noting that p > p& — 1) > 1, we can now take L > Lq = (2C) «=(p-i)-i to have 

a finite sum. □ 

Remark 5.5. One can show by the same proof technique as above, that for all fixed 
v G N* and all fixed k € N 

¥ (vi has k children before v explodes) = O \ i~( p ~^ k j as i — > +oo. (5-8) 

To prove this, note that the event in 15. S\) is 

{fc-i 
^x(vi,k)<Y,x(v,j: 
3=0 3>i 

because B{vik) = B{vi) + Y^jZa X(vi, k) and B(v) + V{v) = B{vi) + J2j>t x (v,j) . 
TTien appZy Lemma \4-b\ as in the previous proof. 

Similarly, one can show that, for all t > 0, all v,w £ N* and aZZ finite trees T n 
such that v has n children in T n , 



P (w has k children after time t and before v explodes \ W(t) = T n 

= O (n-^ k 

We will employ this remark in the proof of Lemma \6.2L 



asn-> +oo. 
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5.3 Subtrees and the time until k descendants are born 



There are two reasons why Lemma 15 .HI and Lemma 15.41 do not imply Lemma 15.11 
First, there are a G N* that are neither L-moderate nor L-extreme. Second, the 
above lemmas only bound the probability of a certain node having degree > k, which 
is different from k- fertility for all k > 2. The next Lemma deals with the latter 
difficulty. Fix some a G N* and let W a (t) = {c G N* : B{ac)-B{a) < t} (for t € R) 
be the subtree of W(t + B(a)) rooted at a. Clearly, W a (-) and W(-) = W e (-) have 
the same distribution. Moreover, a is /c-fertile if and only if the size of W a (S — B(a)) 
is at least k + 1 (i.e. W a («S — H(a)) has at least vertices other than the root). 
Lemma 15 . 61 provides tools for the analysis of the A;- fertility event. 

Lemma 5.6. For a fixed a G N* ; let To (a) be the time of the first birth of a node 
other than the root in the W a (-) process. Moreover, for i G N, let %(a) be the time 
elapsed between the ith and (i+ l)th births in W a (-) (again excluding the birth time 
of the root). Then there exist a sequence of random variables {TZj(a)}^^ such that: 

1. {7^j(a)}j=o * s a se Q uence of independent random variables; 

2. the sequence {^"(a)}.-^ ^ s a deterministic function of the random variables 
{X{ac,i) | c G N*, i G N}; 

3. for each j G N U {0}, T^-(a) = d exp((j + l)f(j)); 
4- 7Zo(a) = To (a) and for all j G N TZj(a) < Tj(a). 

Proof: It suffices to consider the case a = e. For convenience, we introduce the 
notation 

YZx (H c = a . . . c m G N*\{e} 
c = e 



Sfc) = 



We prove inductively that the random variables {1Zj(e)} r j =0 can be defined as above, 
so that for all j G N U {0} TZj(e) is completely defined by the values of X(c,r) for 
c G N*, < S(c) + i < r. For r = 0, this is easy: just set Ko(e) = 71(e) = X(e,0). 
Now assume inductively that Hj(e) has been defined for all < j < r = n — 1. To 
prove that the same is possible for r = n, condition on a particular value 

>V £ |j^T}(e)j = A G S,\A\ < +oo. (5.9) 

^j=i T}(e) is exactly the birth time of the nth descendant of the root in VV e (-) (for 
e is born at time 0), hence \A\ = n + 1. We also notice that, S(c) + d^(c) < n for 
all c £ A. Indeed, the sequence 



cdA(c) if cU(c) > 
c ifd A (c)=0 
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is an element of A with £(&) = S(c) + cIa(c), and it is a simple fact (whose proof 
we omit) that E(6) < - 1 for any 6 G ^4 G £ fin . 

Conditioned on the event in 1)5.9)) . the random variable T n (a) has exponential 
distribution with rate ^ceA /(^-a( c ))j which is bounded by |A|/(n) < (n + l)f(n) 
by the above remarks. Therefore, 

n n (a) = ^A^^l Tn (a) < T n+1 (a) j where .1 

is exponential with rate (n + l)f(n) irrespective of A, by the multiplication prop- 
erty of exponentials (cf. Subsection 12. Because W a (^"~ 1 1 Tj(a)) and 7^(a) are 
completely defined by the random variables {X(ac, j) : c G N*, S(c) + j < n + 1}, 
the same is true of 1Z n (a). This finishes the proof. □ 




5.4 A general bound on the probability of /c-fertility 

Lemma f5.6l is now used to prove a stronger form of the bounds in Subsection l5.2l that 
applies to alia G N* (and not just L- moderate or L-large sequences). To present this 
bound, we need a definition. For a fixed L > and a sequence a = a\a2 ■ ■ ■ a m G N* 
of length m, the set of small indices in a is smj^a) = {1 < i < m : ai < L}, and 
the set of large indices in a is lg^(a) = {1 < i < m : > L}. 

Lemma 5.7. There exist constants C, Lq > depending only on k and p such that 
for any T > 0, L > Lq and a = a\ . . . a m G N* 

maxiT C\ m -r-r 1 
P(a is k-fertile and Vie) <T)< , { | . Wl ■ (5.10) 

Proof: For most of the proof, we will only assume that Lq > k; more conditions on 
Lo will be imposed later. Set a m+ \ = k and for each i G lg^(a) define Ii to be the 
smallest j G lgi(a) U {m + 1} satisfying j > i; notice that the choice of L implies 
ai i > Lq > k whenever ij < m + 1 . Employing the random variables {7£j(a)}^~Q 
whose existence Lemma IS~^1 guarantees, we deduce that 

fc-l 



{a is fc-fertile} C {B(a) + ^ ft„(a) < S} (5.11) 

n=0 

fe-1 

= {B(a)+X(a,0) +Y J ^n(a) <S}. 



n=l 
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In what follows, we will bound the probability on the right-hand side, noting that 
the 7£j(a)'s and X(b, i)'a that appear in the definitions below are all independent 
because of Lemma 15.61 Consider the following events. 



fc-l 



B(o) + X(a, 0) + n n{a) < S and V(e) <T , (5.12) 



n=l 



G„ 



^ X( ai ...Oi,0) <T 
iesm L (a) 



(5.13) 



X(ai...ai,0) \ ^ 

+ £ i= l*(ai---a/ i -i,j) y ^ 

(i G lg L (a), J* ^ m + 1), 



(5.14) 



fc-l 



X(a 1 ...a i ,Q)+J2K j (a) < Y, X ^ 
(i € lg L (a),Ii = m + 1). 



(5.15) 



The first event is the one whose probability we want to bound. The second event 
is similar to the one in the proof of Lemma 15.31 whereas the remaining events are 
reminiscent of those in the proof of Lemma 15.41 We now claim that: 



Claim 5.8. It holds that 



FjcG^n p| H a 



(5.16) 



Moreover, the events on the right-hand side of $5.16]) are independent. 



Claim 15.81 is proven at the end of the current proof, but we now present the 
following concrete example of its application to illustrate our argument. Assume 
that k = 2, L = Lq = 3 and a = a\a2 ■ ■ ■ «6 = 142461, in which case sm/,(a) = 
{1,3,6} and lg^(a) = {2,4,5}. Figure [21 represents some of the random variables 
involved in 1)5.16)) by rectangles. The first six columns of rectangles stand for random 
variables of the form X(b,j) for b = e (the empty string), 1, 14, . . . , 14246, and j = 
0, 1, ... ,6, while the last column represents the random variables 1Zq{o) = X(a,0) 
and !Z\{a). The rectangles that lie completely below the dashed line correspond to 
the random variables that appear in 
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Figure 2: Illustration of Claim l5~£l for a = 142461. 



3 i 
B(a) = X(e,0) + ^X(l,ix)+ ]Tx(14,j 2 ) 

ii=o j 2 =o 

3 5 

+ X(142, j 3 ) + Yl ^(1424,j 4 ) + X(14246,0). 

J3=0 J4=0 



(5.17) 



Moreover, 



4, 
5, 
7. 



2 
4 
5. 



By checking the definitions of G^ and -f/^i, one can check that the following state- 
ments hold. 

1. 3 Fj , since in the event Fj the explosion time S is at most T, and the sum 
(|5.17|) defining B(a) + X(a, 0) contains the terms of X^esm L (a) ^( a i> • • • a «> 0)- 
Moreover, the random variables appearing in G^ correspond to the rectangles 
marked with triangles in Figure O 

2. H a> 2 12 Fj . In order for Fj to happen, a\a2 ■ ■ .ai 2 = 1424 must be born before 
a\ = 1 explodes. In particular, using the critical fact that k = 2 < L = 3, so 
that ai 2 > k, 1422 = a\a-i . . ■ aj 2 ^\k must be born before node a% = 1 explodes 
. Since ai explodes at time <B(ai) +P(ai) = B(l) + X(1,0) + X(1,1) + 
1422 is born at a time that is larger than #(14) + X(14,0) + X(142, 1) and 
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0(14) = B{1) + X(l, 0) + • • • + X(l, 3), it follows that 



F T 

B(1) + X(1,0)+ 
C { I +--- + X(1,3)+ | <0(1)+X(1,O)+X(1,1) + 
X(U, 0) + X(142, 1) 

C ^X(14,0)+X(142,l) <^X(l,i) ^ =F a , 2 , 

i>4 

so -ff a ,2 5 -F 1 ^ is indeed true. Moreover, one can check that the random 
variables appearing in the definition of H a ^ are precisely the ones marked 
with circles in Figure |2j 

3. Similarly, one can show that H a ^ D Fj (respectively, H a ^ D F^) and that 
the random variables marked with stars (resp. squares) are precisely the ones 
appearing in the definition of H a ^ (resp. H a ^). 

Items 1., 2. and 3. above imply not only the validity of (|5.16|) . but also that no 
random variable of the form X(-,--) orTZ.(a) appears in the definition of more than 
one of the events in \5.13\) - \5.15\) . Since those random variables are also indepen- 
dent, we have proven that G^, H a ^, H a ^ and H a ^ are independent events, which 
implies the Claim in this special case. The proof of Claim I5~%1 for general L, k and 
a is entirely analogous to the argument sketched above. 

We continue with the proof of Lemma 15.71 noting that Claim 15.81 implies 

P(if) <P(C#) x [] P(fla,i). (5-18) 

The remainder of our proof consists of bounding the probabilities on the right-hand 
side of (|5. 18f) , which is done in roughly the same way as in Lemmas 15.31 and 15.41 
The probability of G^ is bounded using Lemma 14. 71 with the Z[s corresponding to 
the X{a\ . . .ai,0) for i E sm^a) and A = T. 

T \sm L (a)\ 

Now fix some i € lgi(a) with F ^ m + 1. We apply Lemma 14. HI with Y\ = 
X(a x . . .Oi,0), Ye = X(a 1 . ..a h -xJ- 1) (2 < I < k) and 

{Xj}j>n = {X[ai . . . Oi_i, j)}j> ai - 
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In the present case, the distributions of the Y/s are all defined in terms of / and k. 
Therefore there exist C, no depending only on k and / such that if cij > no, 



C 



(p-l)k- 



(5.20) 



For i G lg_L(o) with Jj = m + 1, a similar reasoning with = lZi{a) for 2 < I < k 
implies that for (possibly enlarged) C, no depending only on k and /, and all en > no, 
|5jZUJ) still holds. So if we take L > n , we can plug (|5TT§|) and lf5~2U)) into (fB~TH)) 
for any a, which finishes the proof. □ 

To conclude, we now prove Claim 

Proof: [of Claim !5.<Sj We first show that show that each of the events that are 
(re) defined below 



H„ 



G, 



X( ai ...a h 0) <T 

iesm L (a) 



X{ai...ai,Q) \ ^ 

(i € lg L (a), J< / m + 1), 



(5.21) 



(5.22) 



H a ,i = ^ X(ai . . .Oi,0) +y^^j(a) < ^ ^"(ai • • • «i-i, j) 
(* € lg L (a), J< = m + 1), 



contains 



fc-i 



(5.23) 



B(a)+X(a,0) +^2Hj(a) < S and 7>(e) < T \ . (5.24) 
j=i J 

First containment: D Fj. On the one hand, all terms appearing in the 



sum 



X(ai... ai ,0) 

iesm L (a) 

also appear in the sum defining B(a) = B{a\ . . . a m ) (cf. I|4.1|) ). so that 

fc-i 

Y X( ai ...a h Q) <B(a) <B(a) + X(a,0) + J2 n j( a )- ( 5 - 25 ) 



iesm L (a) 



3=1 
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On the other hand, by the definition (|4.4|) of S 

S < B{e) + V(e) = P(e) since B(e) = 0. (5.26) 

Therefore, 

Fj occurs => X( a l • • • «i, 0) < V(e) < T =^> G„ occurs . 

i6Sm L (a) 

Second containment: H a ^ D Fj if i G lgx,(a) and ij < m + 1. Consider the 
sum 

fc-i 

X(ai . ..a h 0) + ^X(ai . ..aj._i,j) 

3=1 

In the present case, Ii G lg^(a). Our choice of L > k is now used, for it implies that 
aj. > L > k, and hence 

fc-l o/j-l 

X(ai . . .Oj,0) + ^X(ai . . . a^-ij) < X{a\ . . . aj,0) + ^ X(ai . . .a^-i, j). 
j=i i=i 

The terms in the above sum each appear once in 

m— 1<H+1 — 1 fc— 1 

£ J] X(ai...a t ,i)+X(a,0)+^^(a) (5.27) 

fe-i 

= B(a) -B(ai...ai) + X(a,0) + ^7^(a) 

3=1 

and it follows that 

fc-l 

X(oi . . . Oj, 0) + ^ X(ai . . . a Jt -i,j) (5.28) 

3=1 

fc-l 

< B(a) - B{ai ...(*) + X(a, 0) + ^ Kj(a). 

3=1 

Therefore, 

fc-l 

r„ occurs 

3=1 
fc-i 

=► X( ai ... ai ,0) + J2 X ( a i ■ ■ ■ oii-i. J') < «5 - B(oi . . . a,). (5.29) 
3=1 
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But it is always true that 

S < B(ai . . . Oi_i) + V{a\ . . . dj_i) 

and 

B(ai . ..<H-i) + Viflx . . . Oj_i) - B(ai . . . Oj) = y~] X{a x . . .a^i, j). 

3><H 

Hence 

Fj occurs (5.30) 

k-1 

=> X(ai . . .Oi,0) + ^X(ai . . . a h -i,j) < ^ X{a\ . . . cti-i, j) 
H a j occurs. 

Third containment: H a>i c Fj if i G lgz,(a) and Jj = m + 1. In this case, 
the terms of the sum 

k-i 

X( ai ...ai,0) + J2Kj(a) 
i=i 

are all contained in 

m—lo, t +\ — l k—1 

X( ai ...a t ,j)+X(a,0) + Y,^j(a) (5.31) 

t=i j=0 j=l 

k-1 

= B{a) - B{a x ...<*) + X(a, 0) + ^ TZj(a). 

3=1 

The rest of the proof proceeds exactly as in the case of the second containment. 

We now show that the events in Q5.21JI to (|5.23|) are independent. This is proven 
by showing that no term X(b, r) appears in the definition of more than one of those 
events. We will analyze three different cases. 

Comparing to the remaining events. is entirely defined in terms of 
X(a\ ■ ■ ■ at, 0) for t S sm^a). The only terms of the form X(b, 0) appearing in the 
definition of the events H a ^ have b = a± . . . for i € \g L (a). This implies that no 
random variable appears in the definition of both G^ and H a> i, for all i S lgiX a )- 

Comparing H a ,i to H a j for i < Ii < £, i,£ e lgi(a). The definition of H ati 
only involves random variables of the form X{a\ . . . at,j) for some j € N U {0} and 
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t < Jj — 1 < I — 1, whereas the definition of -£f a ^ involves JT(ai . . . a s , j) for s > 1. 
Therefore, the ranges of the indices t and s will never overlap in this case. 

Comparing H a ^ to H a ^ for i < Ii = £, i,£ e lgi(a). By the same argument 
and with the same notation as above, the only "possibility for trouble is when 
t = Ii — 1 = £ — l = s. This is precisely where the assumption that L > k comes in. 
The event involves random variables of the form 

{X(ai...a/ 4 _i,i):l<j<fc-l}, (5.32) 

whereas the event H a ^ uses the random variables 

{X(ai...a(,_i,j):j >ag}. (5.33) 

Since i € \g L (a), an > L > k, the ranges of j in the two formulae above do not 
overlap, and we are done. □ 



5.5 Proof of Lemma 15.11 

Having proven Lemma 15.71 we now come to the end of the proof of Lemma 15.11 
Proof: [of Lemma l5.1j Our aim is to show that for any T > 0, 

P (a is Ai-fertile | V(e) < T) < +oo. (5.34) 

aGN* 

To this end, we employ Lemma 15.71 and prove instead that for some fixed number 
L > Lq depending only on /, k and T, 



^ \sm L (a) ! 11 „(p-!) fc v ' 



We will eventually choose some L such that 

sm L (a ! J-J- &»-i)* 

which clearly implies (|5.35() . Fix some m and a subset 5 C {1, . . . , m} of size \S\ = s. 
The sum of the above quantities over all a of length |o| = m with sm^a) = S 1 is 

max{r,Cr tt V 1 

s ! 11 2^ (p-l)fc' 

ie{l,...,m}\Soi>i a i 
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because there are L s ways of choosing the Oj's with j E sm^a). Now note that S 
can be chosen in (™) for any < s < m, and therefore 

max{C, T} m -n I 



E 



n 



|sm L (a)|! „ 

, Hr-?. ielg L (a) a j 
|smi(o)|=« 



(p-i)fc 



(5.37) 



I m "i-s 



m\ max{C, T} T 

s) 7l 11 (p-l.A- 



n 5Z fp-i) 



< 



< 



s ) si 



dx 



rpm 

/p\s—a(m—s) 



where a = k(p — 1) — 1 and 



T = max{C, T} x max < 1, — > . 

I a J 

Here we make critical use of the condition p > p^ = 1 + 1/k: under this assumption, 
a > 0. Summing over s, we discover that 

<--<SSi> " 



E 



max{C, T} T 



1 



|sm L (a)|! 11 „&>-!)* 

|a|=m ' V 71 ,j'Glg L (a) a i *=0 



Si 



(5.38) 



To bound this last sum, we split it into two parts, corresponding to s < am/2{\ + 
a) and s > am/2(l + a). For the first part, we forget the s! term and bound 
s — (m — s)a < —am/2; for the second, we simply bound s! > [om/2(l + a)]! and 
s — (m — s)a < m. 



E 



2(l + a) 



E 



s> 



77? 



777 



rpm ps—(m~s)a 



pm ps—(m—s)a 



< 



< 



2(l+c0 



2(l + a) 



2(l+a) | • s >2lfq^ 



It follows that for L > (4T) 2 / a , which only depends on /, p and T, 



E 

\a\=m 



Si 



< 2 m — + 

— 1 





m 




am 


! 




2(l+a) 





-n(m) 



as m-> +oo. 



(5.39) 
(5.40) 



This proves (|5.36j) and finishes the proof. □ 



31 



6 The structure of the infinite tree 



Now that the proof of Theorem 11.11 is complete, we proceed to prove Theorem 11.21 
We will assume throughout the section that f{x) = {x + l) p (with p > 1) and that 
k = k p is as in the statement of the Theorem. As in the previous section, it is 
convenient to break the proof down into steps. 

Lemma 6.1. If S is a rooted tree with \S\ =1 + 1 vertices, then for all a E N* 



Lemma 6.2. Let T be any finite tree and v be a vertex of T. There is a positive 
probability that all of the following events hold: 

1. the labelled GN process reaches state T; 

2. v is the unique vertex present in T to have any children after state T is reached; 
and 

3. all nodes that are born after state T is reached are I- fertile for some i < k. 
As we shall see below, these lemmas permit that Theorem 11.21 is easily proven. 

Proof: [of Theorem II. 2j By Claim 14.21 there almost surely exists a unique node 
ueN* with B(y) + V(v) = S, and all other nodes have finitely many descendants 
in Tqo. Moreover, since p > 1 + l/k p , one can apply Theorem 1 1 . 1 1 and deduce that 
with probability 1 there are only finitely many children vn of v that are fc-fertile. 

If we remove all other children of v (i.e. those that have < k — 1 descendants, 
which must be infinitely many) and their descendants from Tqo, we obtain a finite 
tree T. We claim that in fact T ro = Glue(T, v, k). For consider some (rooted, 
oriented) tree S with \S\ < k. by Lemma ldl( there almost surely exist infinitely 
many n € N such that W V n(V(v) — Y^ij=o i s isomorphic to S, and because 



S = B{v)+T{v) = Bivnj + iV^-^ZoXivJ)), this implies that W m (S-B(yn)) 



is isomorphic to S for infinitely many n. But W vn (S — B{vn)) is the subtree of 
Too = VV(«S) rooted at (and oriented towards) vn, hence with probability 1 there 
are infinitely many n € N such that the subtree of T^ rooted at vn is isomorphic to 
S. This is true for any S of size < k, so all such trees must appear infinitely often, 
and finishes the proof of the claim. 

We have shown that Tqo is always isomorphic to some Glue(T, v, k). Moreover, 
Lemma l6.2l savs that any Glue(T, v, k) has a positive probability of being the value 
of Tqo. This finishes the proof. □ 

We now proceed to prove to prove Lemmas I6.1l and l6.2l 



P (#{n G N : Wan{V{a) + B[a) - B(an)) is isomorphic to S} = +oo) 




(6.1) 
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6.1 Proof of Lemma 16.11 



Proof: [of Lemma I6.1| The p > pi case is implied by Theorem II. 1[ so we focus on 
p < Pi, using "~" to denote a rooted oriented tree isomorphism. We will prove the 
theorem only for the case a = e. This entails no loss of generality because the joint 
distribution W a (-),V(a) and {B(an) — B(a)} ne ^ does not depend on the choice of 
a <E N*. 



Define the sequence of events 

B n = {W n (V(e)-B(n))^S} (n € N). (6.2) 
Our goal is to show that 

P (B n infinitely often) = 1. 

If the events B n were independent, we could apply the Borel-Cantelli Lemma for 
independent events to prove this statement. Since independence is lacking, we will 
substitute the events B n by a sequence of independent events A n such that 



' (A n i.o. but not B n i.o.) 



0. 

+oo. 



(6.3) 
(6.4) 



neN 



Because the sequence {A n } consists of independent events, equation ()6.4|) implies 
that A n infinitely often almost surely, which implies (via equation (j6,3jl ) that B n 
infinitely often almost surely. Therefore, (|6,3|) and (|6.4j) imply the Lemma. 



We define the sequence A n as follows 

1 3 

A n = t Vt <E 



2(p - l)nv- 1 ' (p - l)^- 1 



W n (t) ^S} (neN). 



(6.5) 



The independence of those events is a consequence of the independence of the pro- 
cesses {W n (-)} n gN- Moreover, 



'(-A n i.o. but not B n i.o.) 

< P (V(e) - B{n) 



i.o. . (6.6) 



2(p - l)^- 1 ' 2(p - l)nP- 1 

We claim that the event on the RHS of H6.6|) has probability 0. To see this, note 
that 

V(e)-B(n) = Y J X(e,j) 

j>n 
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is a sum of independent, rate-/(j) exponentials, and 



E[V(e)-B(n)) = J24-y 

j>n J ^ J> 

As a result, direct use of Lemma 14.51 and the estimate 



]=n 



(j + 1)P (j) - l)^" 1 



(n » 1) 



implies 



^P(^)-B(n)0 



2(p- l)nP~ 1 ' 2(p- l)^- 1 



< +00. 



Therefore, the Borel-Cantelli Lemma implies that 

r 1 3 

V(e) - B(n) [ 2(p _ 1)nP _i. 2(p _ 1)nP _i 

thereby proving the claim and (via (|6.6[) ) equation (|6.3() , 



1.0. 



(6.7) 



It remains to prove (|6.4|1 . For this purpose, we will only need a very rough lower 
bound on the probability of A n . Consider a labelling of the elements of S. That is, 
pick a finite parent-closed subset of N*, i.e. an element S € £ fin , that corresponds 
to a labelling of the vertex set of 5 as defined in Subsection 13.21 We assume that S 
is ordered 

5 = { s (°) = e , s ( 1 ),... s W} (6.8) 

in a way such that for all 1 < i < £, there is an index pi < i such that is the 
parent sequence of s". We also define the subsets 



S(i) = {s w ,...,s w } (1 < t <£) 

The ordering property implies that S(i) is also a parent-closed subset of N*. Now 
define (for 1 < i < £, where applicable): 



Cnii) 



2(p - VjnP- 1 
3 



2(p - l)^- 1 ' 

j s 1 -^ is the only vertex born in {W(t) }^ c rj_i^ *t n ] } 
{no vertex is born in {W(t)}i e[4niTn ] } . 



(6.9) 
(6.10) 

(6.11) 
(6.12) 
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Clearly, 



'(A„)>p(c n (i)) x mp c n 



i=2 




PCnCr)). (6.13) 

r=l / 



(In fact, A n is defined in terms of W ra (-) rather than W(-), but in terms of evaluating 
the probabilities that does not make any difference since these two processes have 
the same distribution.) We will lower bound the probabilities on the RHS of the 
above inequality. 

Probability of C n (l). The probability of C n (l) is the probability that the 
birth time of is B(s^) < t n /t and that no other birth occurs in the time 
interval [B(s^),t n /£]. Conditioning on a value < B(s^) = t < t n /£, the time of 
the next birth in W(-) is 



mm 



{X(s^,0),X(s^,l)}= d exp(/(0) + /(!)). 



Hence 



C n (l) | B(s^) = u ) = P I exp(/(0) + /(!)) > 



u 



> e -(/(0)+/(l))^-« > e -(2/(2))(^)_ (g_ 14 ) 



Moreover, 



B( 8 (°)) < ^ 



1 - e 



-/(o)% 



Since i n — > as n — > +oo, it follows that there exist constants C\,n\ > such that 
for all n > rii 



We conclude that 



P (V 0) ) < y) > Cl*n- 

p^u^c^e- 2 ^. 



(6.15) 



Probability of C n (i), 2 < i < i. In this part, we will make use of the 
Markov property of the continuous-time process (cf. Remark I4.4j) . Notice that 
the conditioned event is defined entirely in terms of {W(s)}o< s <(i_i)t n /^, whereas 
C n (i) is defined entirely in terms of {W(s)} s >( i _ 1 )f n /f . Moreover, it is also true that 
inside the event n^^CV^j) 



W 



1 



t n ) =S(i-l). 



(6.16) 
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Therefore, we can apply Remark 14.41 to deduce 



P C n (i) 



i-1 



3=1 



P C n (i) 



w 



i - 1 



t n )=S(i- 1) 



(6.17) 



For C n (i) to happen, two conditions must be satisfied. 

1. B(*W) - (i - l)i„/^ < t„£ That is, sW must be born in the interval 

[(i-i)i n /*,»t„/4 

2. No other birth happens in the interval [(i — l)t n /£,it n /£]. 
Choose a value < u < t n /£. We will now bound 



C„(i) 



(< - 1) ^^^(t-l) ) B(.W)= u +( < - 1 ^ 



(6.18) 



In this case, note that the rate at which the first birth of a node a ^ happens 
in S(i - 1) is 2 



Ri-x = Y, f( d s(i-i)( sU) )) < (i + !)/(» + 1) 



(6.19) 



0<j<i, j^Pi 



the inequality being justified by the fact that the cardinality of S(i — 1) is i. The 
rate of births after time (i — l)t n /£ + u under the conditioning of (|6.18l) is 



0<j<i 



(6.20) 



Under the conditioning in ()6. 18() . C n (i) holds iff no a ^ is born in the time 
interval [(i — l)t n /£, (i — l)t n /£+u] and no births happen in [{i — l)t n / £+u,it n / £}. By 
the Markov property of W(-), these events in different time intervals are independent 
given W((i — l)t n /£ + it) = S(i). Therefore, we can write 



C n (i) 



(i - l)t n 

£ 



S(i-1), B(sW) = u + 



(i - l)t r 



= P (exp(-Ri_i) > u) x P (exp(Ti) > t n /£ - u) 

= e --Ri-i« e -Ti(^-u) > e -(i+l)/(i+l)^. ( 6 .2l) 

2 This is the rate uniii some birth happens, whether it is the birth of a or of some s^' ^ a. 
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As a result 

P (c n (i) 

> e -(i+l)/(i+l) 



w _ (j - l)t ra t» / £-1 



S(i-1) . 

(6.22) 



Now notice that conditioned on W((i — l)t n /£), 
Hence 

P (b(* W ) - < | U (^y^) = " 1)) 

= i _ ^/(d*,.^))* > ! _ e -/W^. (6 . 23) 

To state our bound for the probability C n (i), we note that t n — > as n — > +oo, and 
therefore there exist constants Ci,rtj > depending only on ^ and / such that for 
all n > m 



P C n (i) 



w 



= S(i - 1)) > e-t'+D/ti+^Qt, 



(6.24) 



Probability of D n . For this bound, we again use the Markov property of 
W(-). Notice that whereas D n is only defined in terms of {W(i)}t>t„, the definition 
of Ui<eC n (i) only depends on {W(i)}o<t<t n • Moreover, inside the latter event, 
W(t n ) = 5*- We can then apply Remark 14.41 to conclude 



P A, 



|J^n(i 



W(t„) = 5 . 



(6.25) 



i=l 



Under this last conditioning, the rate of new births in W(i n ) is 

£/(d s (a)) <(* + !)/(* + !) 



and the probability that none of those births occur in [t n ,T n ] is precisely 
P (A* |>V(t n ) = £) = e -(^-*«)E seS /(d s (s)) > e -(T n -t n )(e+i)f(i+i) _ 



(6.26) 
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Wrapping up. To finish this proof, we plug (jHHJ), jOH) and (|6~2l)|) into (|6TT3|l . 
letting n > max{n, : 1 < i < £} and C = C1C2 . . . Cg 

W(A n ) > C^n e_(m)/(m)if ) x e-( T "-*")( £+1 ^ +1 ) x ^ (6.27) 
> Cexp(-(*+l)/(* + l)T n )£. (6.28) 
For £ fixed, n — > +00, we deduce (using the definition of t n and T n in (|6.9|) . l|6.1Ujl ) 

p(^.) = n(4)=n(n-^). 

Now the assumption p < p£ comes into play, for it implies that (p — 1)1 < 1. As a 
result 

^P(A„) = £n(n-^)=+oo. 

n>l n>l 

This proves ()6.4|) and finishes the proof. □ 

6.2 Proof of Lemma 16.21 

Proof: [of Lemma I6.2j Let T have u + 1 vertices and let v have r children in T. 
Let (T, v) + n denote T with n additional children added to v. Asymptotically in n 
we consider the probability that Too is not isomorphic to Glue(T, v, k) conditional 
on the GN process reaching (T,v) + n. Each w 6 T, w ^ v, has probability o(l) 
of having a child before v explodes. Each of the n additional children of v has 
probability 0{n k<yl ~~ p ^) of having k (or more) descendants before v explodes. For 
i > n + I the i-th child of v has k (or more) descendants before v explodes with 
probability O^^ 1- ^). The total probability of any of these events occurs is then 
bounded from above by u ■ o(l) + n ■ 0(n fe(1 ~ p )) + J2 i>n+ i 0(i k<yl ~ pS) ) which is o(l) 
because k(l — p) < —1. We can therefore find an explicit n so that this probability 
is less than, say, ^. 

With positive (perhaps small) probability the first n + / steps of the GN process 
yield (T, v) + n. Then with probability at least ^ the final is Glue(T, v, k) as 
desired. □ 

7 Conclusion 

The two main theorems of this paper completely characterize the limits of the super- 
linear GN process. Some of their consequences are the fact that the tree T^ has 
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finite height (and thus the finite-time GN trees have bounded height), and that the 
nodes of in-degree < k — 1 (where p > pk) are all but finitely many. However, these 
characteristics raise many interesting questions about distributions of the above 
quantities. For instance, what does the tail of the height distribution of look 
like? We believe that the methods presented in this paper might be sharpened to 
prove this and other results. 

There are many more open questions about the p < 1 case of GN. The authors 
of have derived some results on their modified model for this range of p under 
the assumption that certain limits exist. Proving unconditional results of this na- 
ture for the GN model remains an important open problem that is also potentially 
amendable to treatment by our techniques, since the exponential embedding applies 
to any attachment kernel. 

It would also be quite interesting if the exponential embedding could be used to 
prove known and new properties of related network models, in particular the original 
Barabasi- Albert preferential attachment model. The rigorous version of the process 
defined in is essentially the GN process defined in our paper with attachment 
kernel f(x) = x + 1, and it could be the case that the embedding method is a viable 
technical alternative to the "linearized chord diagrams" of 9 . 

A Appendix — proofs of technical lemmas 

Proof: [of Lemma 14. 5 j We will only prove the first inequality, for the proof of the 
second one is very similar. The technique we employ is fairly standard and is 
commonly used in other proofs of Chernoff-type large deviation inequalities Pj . Let 
A n = Ylj> n Xj— /GO • Fix any < s < (n+l) p /2 and notice that, by the standard 
Bernstein's trick, the formulae in Subsection 12.11 the inequality "1 + x < e xv , and 
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some simple calculations 
F(A n > 5) 



e sA n > e sS 



-sS 



3>n 



e i x >-7m) 



j>n 7U) 

e~ sS x 

2_ I S S 

e 7U) 1 + + 

j>n \ 



< exp 



2s' 



(2p - l)n 2 P" 1 



s6 



To finish the proof, we set s = n p 1 ^ 2 , which is permissible since riP l / 2 < (n + l) p /2 
for all large enough n. □ 

Proof: [of Lemma l4.6| To begin with, we note that 

k k 

Ve > Yl p (*i < |) < F (Y < e) < JJP (Y* < e) 



i=l 



i=l 



and therefore the assumptions imply the existence of a constant Co depending only 
on the distributions of the Y,-'s and on k such that 



Ve > 0, — < P (Y < e) < C e fc 

We now use the notation and results in the proof of Lemma 14,51 with 5 = n 3//4_p . 
Then 



Vra > no, 



>5 \ <2Cie 



i 
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for some constant C\. Then 



'(X<n-S) 



3>n 



> 5 



<p y<^i 3 - < P (Y < n + 5) + 

j>n 



3>n 



> 5 



and by the previous bounds 



Cn 



2Ci e" n3 < P [ Y < E ] < C o(v + 5 ) k + 2C i e ~ n 



The result now follows from the fact that, as n — » +oo 



i i 

1 ■ - -ni 



(p - l)nP 



— ~ /i ± 5 » e" 



□ 



Proof: [of Lemma l4.7j P (Z < A) is equal to the probability that there are at least A: 
arrivals up to time A in a Poisson process with rate 1. This has a Poisson distribution 
with rate A; hence we have the exact result 



(z<\)=y,< 



The upper bound follows from 



j=k 



< 



A fc e 



^ ^ A^' A^ ^ > 

J. = ~k\^ Q JkTT)(k + 2)...(k + e) -= k\ 



k„\ 



□ 
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